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HANSENULA POLYMORPHA YAPSIN DEFICIENT MUTANT STRAIN AND 
PROCESS FOR THE PREPARATION OF RECOMBINANT PROTEINS USING THE 

SAME 



5 Technical Field 



The present invention relates to a process for producing a recombinant protein 
secreted and expressed from methanol-utittring Hansemila polymorpha yeast at a high 
yield by efficiently preventing decomposition of the recombinant protein. 

More preferably, it relates to a process for efficiently producing a recombinant 
protein by destroying yapsinl gene of H. polymorpha strain to prevent decomposition of 
the protein containing a basic or dibasic amino acid residue produced in H. polymorpha. 

More particularly, it relates to a process for producing a recombinant protein in 
an intact configuration at a high yield in H, polymorpha by preventing decomposition of 
the recombinant protein having a basic or dibasic amino acid residue in the protein, such 
as human parathyroid hormone, human serum albumin and serum albumin fusion protein, 
which comprises cloning the HpYPSl gene encoding Hansenula polymorpha aspartic 
protease type yapsinl , preparing a H. polymorpha strain with the defective HpYPSl gene 
using the cloned gene and cdturing a transformant transformed from the strain as a host 
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Background Art 

Recently, as demand for high-purity protein medicaments is suddenly increased 
due to increase of incurable diseases and improvement of public medical standard, 
5 relative importance of medicinal recombinant proteins in the health-related 
bioengineering field is highly raised. 

Therefore, the frequency of use of yeast which is a monocellular eukaryotic 
microorganism as a host system for mass-production of a recombinant protein is 
gradually increased. Particularly, since yeast has the protein secretion route very similar 
10 to those of higher animal cells, it is habitually used as a microorganism host system for 
production of human-derived secretion proteins. Also, since most kinds of yeast 
normally secret a very small number of proteins out of the cell, advantageously, 
recombinant proteins secreted from yeast can be readily recovered and purified. In 
recent, mass-production of serum proteins, vaccines and other various important medical 
15 proteins using non-traditional yeasts including Hansenula polymorpha and Pichia 
pastoris, other than the traditional yeast Saccharomyces cerevisiae has been successively 
conducted (Gellissen Q, Appl. Microbiol. Biotechnol. 54, 741 (2000)). 

As a eukaryotic microorganism, yeast secretes proteins by the substantially same 
method with mammal cells and involves similar protein modification and cleavage 
20 procedures. A protein which has undergone the secretion route becomes to have its final 
3-dimensional structure at the Endoplasmic reticulum. In case of glycoprotein, N- and O- 
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bonding sugar chains are attached thereto. Subsequently, the protein is transferred to the 
Golgi apparatus, in which it is further subjected to the protein modification procedures 
such as Uminung of oligosaccharide or protein cleavage, and thereafter, is transferred to 
different organs, inserted into components of the cell membrane or secreted out of the cell. 
5 As described above, since the protein secretion procedures in yeast involve 

various kinds of post-translational modification processes, the secretion and production of 
a foreign protein in yeast may cause many problems. Particularly, when a recombinant 
protein is secreted and produced in yeast, it is necessary to use an efficient expression and 
secretion system in order to increase productivity, but is also important to prevent 
10 decomposition of the produced and secreted foreign protein. If a recombinant yeast is 
cultured for a long period of time at a high concentration in a fermenter, proteases which 
are naturally secreted from the host cell or exist in the cell through cell lysis are released 
to medium and degrade the produced recombinant proteins, thereby causing reduction in 
overall productivity of the recombinant proteins. In order to solve this problem, for 
15 yeasts including Saccharomyces cerevisiae, Hanserrula polymorph Pichia pastoris and 
the like, which have been used as recombinant protein expression systems, various 
protease deficient strains have been developed. Primarily, strains, in which PEP4, PRB1, 
or CPY genes encoding degradative enzymes existing in yeast vacuole are destroyed 
(Alvarez et al., J. Biotechnol. 38, 81 (1994); Chen et al., Curt. Genet 27, 201 (1995); 
20 Gleeson et al., Methods MoL Biol. 103, 81 (1998); Kang et al. In Hansenula pofymorpha 
(ed. Gellissen G) p.124 (2001)), have been developed. In addition to the vacuole 
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degradatfve enzymes, kexl A strain have been developed, in which KEX1 gene encoding 
carboxypeptidase a existing in the Golgi apparatus is destroyed. By the kexl A strain, 
C-end decomposition of hirudin in Saccharomyces cerevisiae (Hinnen et al, In Gene 
expression in recombinant microorganisms (ed. Smith A.), pl21 (1995)), of human 
5 epidermal growth factor in Hansermla polymorpha (Heo et al., Protein expr. purif. 24, 1 1 7 
(2001)) and of rodent or human endostatin in Pichia pastoris (Boehm et al, Yeast 15, 
563-567 (1999)) can be significantly reduced 

Recently, yeast aspartic protease type yapsins having activity to recognize and 
cut basic amino acids existing as a single or a pair in Saccharomyces cerevisiae have been 

10 identified, which are novel proteases existing in the cell membrane (Egel-Mtani et al., 
Yeast 6, 127-137 (1990)). Yapsinl (also previously known as yeast aspartic protease 
3(YAP3)) was firstly known to the public among the yeast aspartic proteases, and yapsin2 
(also previously known as MKC7) was known thereafter (Komano and Fuller, Proc. 
Natl. Acad. Sci. USA 7, 92,10752-10756 (1995)). By the Saccharomyces cerevisiae 

15 genome information which has been recently disclosed to the public, additional genes 
encoding at least 5 yapsin type protease presumed to have similar functions, such as 
yapsin3, yapsin6 and yapsin7, have been reported to exist so far (Olsen et al. Biochem. J. 
339, 407-411 (1999)). Though the physiological functions of these yapsins are not 
clearly shown, as the number of study cases reporting that target recombinant proteins 

20 which are intended to secret and produce in S. cerevisiae are cleaved by the protease 
activity of yapsin is increased, yapsin deficient yeast strains attract public attention as an 
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useful strain for production of a recombinant protein, particularly a foreign peptide having 
a basic amino acid. Recombinant proteins which have been reported to have problems 
of being cleaved by yapsins in secretion and production in S. cerevisiae, till now, include 
human serum albumin (Kerry-wUhams et al., Yeast 14, 161-169 (1998)), human 
5 parathyroid hormone (Kang et al., Appl Microgiol Biotechnol., 50, 187-192 (1998)); 
Korean Patent Registration No. 0246932 (publicated on December 8, 1999)), insect 
diuretic hormone (Copley et al., Biochem J., 330, 1333-1340 (1998)), glucagon and 
glucagon-like peptide (Egel-Mitani et al., Enzyme Microb Technol. 26, 671-677 (2000): 
USA PAT. NO. 6,1 1 0,703) and human elafin precursor (Bourbonnais et al., Protein Exp. 
10 Purif. 20, 485 (2000)). Meanwhile, considering that YPS1 deficient S. cerevisiae strain 
shows a considerable progress in decomposition of hPTH at the last stage of the 
cultivation using a fermenter, the present inventors have developed S. cerevisiae yapsin 
multiple deficient mutant strain (ypsJA^s2^3A), in which the YPS2 and YPS3 
genes coding for yapsin2 and yapsin3 axe removed. As a result, we have obtained an 
15 excellent result of preventing 9Qo/ 0 or more of degradation of human parathyroid hormone 
observed in a high-concentration cultivation (Korean Patent Application No. 2000-51267 
and International Application No. PCT/KROl/01447). 

H. polymorpha , one of methanol-utilizing yeasts, is in the spotlight as a very 
useful yeast host for mass production of recombinant proteins since it has advantages in 
20 that strong and controllable promoters are developed, alike Pichia pastoris, and a foreign 
gene can be multiply introduced into the host chromosome (Faber et al., Yeast 11, 
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1331(1995)). Up to date, various kinds of foreign proteins have been expressed and the 
expression levels often reached over 1 g/L in case of high-concentration cultivation using 
a fermenter. Particularly, it has been reported that when recombinant phytase is secreted 
and produced, the expression level is about 13.5 g/L (Mayer et al, Biotechnol. Bioeng. 63, 
5 373-381 (1999)). Therefore, the K pofymorpha expression system becomes 
distinguished as one of the most potential systems among several presently available 
eukaryotic cell expression systems. Especially, since some of the recombinant proteins 
which have been produced in the initial stage in H. pofymorpha have already passed 
clinical trials and are on the market (ex., hepatitis B vaccine) or in the product 

10 development phase(ex., hirudin), H. pofymorpha is considered as a suitable expression 
system for production of a recombinant protein to be developed as an medicament 
(Gellissen G, Appl Microbiol Biotechnol. 54 741-750 (2000)). Also, as recently getting 
into the post-genome era, there is an increased need for a high-efficiency expression 
system for functional analysis of novel genes, and thus it is expected that an expression 

15 system using K pofymorpha would bear a great part in functional and structural analysis 
of novel proteins as well as mass production of useful proteins derived from higher 
eukaryotic cells. 

Disclosure of Invention 

20 

Therefore, the present invention has been made to develop a high-efficiency K 
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polymorph** expression system by solving the problems related to undesired cleavage and 
decomposition of a recombinant protein by an yapsin type protease, and it is an object of 
the present invention to provide a technology for secreting and producing a recombinant 
protein in an intact configuration at .a high efficiency by preventing cleavage and 
5 decomposition of the recombinant protein expressed in H. polymorph^ in which the 
HpYPSl gene encoding H. pofymorpha yapsinl is cloned, the cloned gene is used to 
prepare HpYPSl gene deficient mutant strain and the prepared mutant strain is used as a 
recombinant protein expression host 

In accordance with an aspect, the present invention provides a gene sequence 
10 encoding yapsinl (^i) of H. pofymorpha strain by probing yapsin protease genes in 
Hansenulapolymorpha strain, as identified in S. cerevisiae. 

Also, in further aspect, the present invention provides a yapsinl polypeptide 
having Hansenulapolymorpha strain-derived aspartic protease activity. 

Also, in another aspect, the present invention provides a secretion signal gene 
15 sequence and a peptide sequence of HpYPSl polypeptide for secretion of a foreign 
protein recombinantly produced in Hansenula pofymorpha strain. 

According to yet another aspect, in order to develop a Hansenula pofymorpha 
strain capable of secreting and producing a recombinant protein in an original' 
configuration * a Kgh efficiency, me ^ 
20 secretion and production of a recombinant protein at a high efficiency in Hansenula 
polymorph by cloning the above-described Hansenulapolymorpha yapsin protease gene 



MSDOCIO. <WO 2004003204A1_I_> 



WO 2004/003204 PCT/KR2003/001279 

and deleting the cloned gene to minimize the decomposition of the recombinant protein in 
Hansenula polymorpha. 

In order to accomplish the above objects, the present inventors have developed a 
Hansenula polymorpha mutant strain (hpypslA), in which the HpYPSl gene is deleted, 
by cloning the HpYPSl gene encoding Hansenula polymorpha yapsinl and subjected the 
resulting HpYPSl gene to a functional analysis. Consequently, we have formed a 
method for increasing productivity of a recombinant protein by using the Hansenula 
polymorpha mutant strain as a host strain to express the protein containing a basic or 
dibasic amino acid residue in the protein, including human parathyroid hormone, human 
serum albumin and human serum albumin fusion protein, thereby significantly reducing 
the decomposition of the recombinant protein expressed and secreted in H polymorpha. 

Brief Description of Drawings 

Further objects and advantages of the invention can be more fully understood 
from the following detailed description taken in conjunction with the accompanying 
drawings in which: 

Fig. 1 shows the nucleotide sequence of gene HpYPSl encoding Hpolymorpha 
yapsinl and the expected amino acid sequence (0: a signal sequence, (2): the expected 
GPI anchor, ®: a hydrophobic amino acid-golgi membrane fixing domain); 

Fig, 2 shows the comparison of the amino acid sequence between Hpolymorpha 
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yapsinl and S. cerevisiae yapsin type proteases; 

Fig. 3 is the result of the functional complementation experiment showing mat 
temperature sensitivity of Sacchawmyces cerevisiae yapsin multiple deficient mutant 
strain is recovered as the Hansemda polymorpha yapsinl gene HpYPSl is expressed; 
5 Fig. 4 is a schematic view showing the procedures for preparing a Hansemda 

polymorpha yapsinl gene -disrupted mutant strain hpypslA; 

Fig. 5 shows the results of comparison of protease activity in the culture 
supernatants of the H. polymorpha wild type and the yapsinl -deficient hpypsl Amutaxfi 
strain, in which human parathyroid hormone is used as a substrate; 
10 Fig. 6 shows a schematic view of the procedures to construct the human 

parathyroid hormone expression vector pMOXhPTH for H. polymorpha; 

Fig. 7A shows the result of Southern blot to examine the expression vector 
insertion sites and the number of copies by isolating chromosomes of the Hansemda 
polymorpha wild type and the mutant strain hpypslA transformed with the vector 
15 pMOXhPTH; 

Fig. 7B shows the results of SDS-polyacrylamide electrophoresis of yeast 
culture supernatants, followed by staining, to compare the expression and decomposition 
aspects of recombinant human parathyroid hormone in the transformants of 
H.polymorpha wild type and hpypsl A mutant strain; 
20 Fig. 8A shows the results of comparison of the expression and decomposition 

aspects of recombinant human serum albumin when the H. polymorpha wild type strain 
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and the HpYPSl gene-deleted hpypslA mutant strain are used as a host, in which yeast 
culture supernatants are analyzed by SDS-polyacrylamide electrophoresis, followed by 
staining; 

Fig. 8B shows the results of comparison of the expression and decomposition 
5 aspects of recombinant human serum albumin when the K pofymorpha wild type strain 
and the HpYPSl gene-deleted hpypsl AmvXmX strain are used as a host, in which yeast 
culture supernatants are analyzed by Western blotting; 

Fig. 9 shows a schematic view of the procedures for constructing the albumin- 
TIMP2 fusion protein expression vector pYHSA13-T2 for Hansemda polymorpha; and 
10 Fig. 10 shows the results of the comparison of the expression and decgradation 

aspects of recombinant albumin-TIMP2 fusion protein when the Kpolymorpha wild type 
strain and the HpYPSl gene-deleted hpypslA mutant strain are used as a host, in which 
yeast culture supernatants are analyzed by Western blotting. The Lane 1 represents the 
wild type transformant transformed with pYHSA12(+), the Lanes 2 to 5 represent the 
15 wild type transformant transformed with pYHSA13-TIMP2, the Lanes 6 to 9 represent 
the hpypsl ^mutant strain transformed with pYHSA13-TIMP2 and the Lane 10 represent 
isolated and purified albumin (200 ng). 

Best Mode for Carrying Out the Invention 

20 

H polymorpha is a class of yeast which can use methanol as a carbon source 
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and energy source. According to the present invention, there is provided a K 
polyinorpha mutant strain with yapsin enzyme destroyed, based on the fact that when H. 
polymorpha is used in a method for producing a foreign protein, the yield of the foreign 
protein is reduced since the produced foreign protein is decomposed by yapsin enzyme 
which is an aspartic protease existing in the yeast 

As described herein, the yapsin enzyme family, which is a sub-family of aspartic 
proteases, has an ability to specifically cleave a basic or dibasic amino acid residue of a 
protein. This is a difference from other aspartic proteases cleaving hydrophobic residues. 

The present inventors have obtained a segment showing similarity with the & 
cerevisiae yapsin 1 encoding gene (YPS1) from the PCR-ampIified chromosomes derived 
from H. polymorpha and conducted researches and studies on functions of the segment 

As a result, the segment was identified as HpYPSl gene of H. polymorpha. 
Further, we have prepared a Hamenula polymorpha mutant strain having the HpYPSl 
gene destroyed and confirmed that a foreign protein can be produced from the strain at a 
high yield. Thus, the present invention has been completed. 

In one aspect, the present invention provides a nucleic acid molecule comprising 
the sequence shown in Fig. 1 . 

More particularly, the present invention provides a nucleic acid molecule 
comprising the sequence encoding Hansenula polymorpha yapsinl shown in Fig. 1. 

In another aspect, the present invention provides a polypeptide comprising the 
amino acid sequence shown in Fig. 1. 

11 
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More particularly, the present invention provides a polypeptide comprising the 
amino acid sequence of Hansenula polymorpha yapsinl(HpYPS 1 ) shown in Fig. 1 . 

In another aspect, the present invention provides a secretion signal gene 
sequence and a peptide sequence of HpYPSl polypeptide for secretion of a foreign 
5 protein recombinantiy produced in Hansenula polymorpha strain, shown as <D in Fig. I. 

In another aspect, the present invention provides a H. polymorpha mutant strain 
having reduced yapsin activity by mutation of the HpYPSl gene encoding H. polymorpha 
yapsinl. 

In another aspect, the present invention provides a recombinant H.pofymorpha 
1 0 strain expressing a foreign protein by introducing a gene encoding the foreign protein into 
the H. polymorpha mutant strain. 

In another aspect, the present invention provides a process for preparing a 
foreign protein comprising culturing a recombinant H. polymorpha strain under 
conditions to express the foreign protein and isolating the foreign protein from the culture. 
15 More particularly, the present invention provides a process for secreting and 

producing a recombinant protein in Hansenula polymorpha using a yeast strain lacking 
the HpYPSl gene for a yapsin type protease as a host The present invention comprises 
the steps of: cloning gene the HpYPSl encoding H. polymorpha protease yapsinl; 
functional analyzing the resulting H. polymorpha HpYPSl gene by temperature 
20 sensitivity complementation experiment of a Saccharomyces cerevisiae yapsin multiple 
deficient mutant strain; preparing a hpypsl A mutant strain with the HpYPSl gene 

12 
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destroyed in Hansenula polymorphs followed by analysis of yapsin activity; and 
preparing a recombinant Hansertula polymorpha strain hpypsl A -pMOXhPTH to express 
and secrete human parathyroid hormone, a recombinant Hansenula polymorpha strain 
hpypsl A-pYHSA12 to express and secrete human serum albumin, or recombinant a 
5 Hansenula polymorpha strain hpypsl A ~pYHSA13-TIMP2 to express and secrete a 
TIMP2 protein fused with human serum albumin recombinant protein using the 
H .polymorpha mutant strain hpypsl A as a host, followed by analysis of decomposition 
level of the recombinant protein. Therefore, the present invention relates to a method for 
efficiently producing a recombinant protein by preventing degradation of the recombinant 

10 protein expressed from methanol utilizing yeast K polymorpha, which comprises cloning 
the HpYPSl gene encoding H polymorpha aspartic protease type yapsin, preparing a H 
polymorpha strain with the HpYPSl gene deficient using the cloned gene, and culturing a 
transformant transformed from the strain as a host to minimize degradation of the 
recombinant protein, thereby producing the recombinant protein in an intact configuration 

15 at a high yield. 

The sequence of the H. polymorpha YPS1 gene {HpYPSl) (SEQ ID NO: 1) 
cloned according to the present invention was deposited in GenBank under Accession No. 
AF493990. Also, the HpYPSl gene encoding H polymorpha protease yapsinl was 
deposited in an international depository authority (Korean Collection for Type Cultures) 
20 on the date of June 1 8, 2002 and assigned Accession No. KCTC 1 0285BR Further, the 
hpypsl A mutant strain having the HpYPSl gene destroyed in H polymorpha was 

13 
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deposited in an international depository authority (Korean Collection for Type Cultures) 
on the date of June 18, 2002 and assigned Accession No. KCTC 10281BR In an 
embodiment, the recombinant H, polymorph* strain hpypsl A -pMOXhPTH expressing 
and secreting human parathyroid hormone was deposited in an international depository 
5 t authority (Korean Collection for Type Cultures) on the date of June 18, 2002 and assigned 
Accession No. KCTC 10282BR In another embodiment, the recombinant K 
polymorpha strain hpypsl A -pYHSA12 expressing and secreting human serum albumin 
was also deposited in an international depository authority (Korean Collection for Type 
Cultures) on the date of June 1 8, 2002 and assigned Accession No. KCTC 10283BP. In 
1 0 addition, the recombinant K polymorpha strain hpyps 1 A -pYHSA13-T2 expressing and 
secreting aIbumin-TIMP2 fusion protein was deposited in an international depository 
authority on the date of June 23, 2003 and assigned Accession No. KCTC 1 0485BR 

Now, the present invention will be explained in detail by the following examples. 
15 However, it should be understood that the following examples are only for illustrative 
purposes and are not intended to limit the scope of the invention. 



<Example 1> 

Cloning of Hansermla polymorpha yapsinl gene HpYPSl and analysis of the sequence 

In order to find a gene encoding yapsinl in H. polymorph a pair of synthetic 
oligonucleotides (5 *-GAAGTGC AGCAGCAGCTCCTGAACC-3'; SEQ ID NO: 3, 5*- 
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GGCTGATGACGGCTCGGTCACGATGG-3'; SEQ ID NO: 4) were prepared, on the 
basis of information on H. polymorph, random sequenced tags described in Blandin et al., 
(FEBS Lett 487, 76, (2000). By PCR (Polymerase Chain Reaction) using the prepared 
oligonucleotides as primers, a 0.88 kb DNA segment was amplified from H. polymorpha 
5 DL-1L derived chromosome. Then, Southern blotting was conducted using me 
amplified DNA segment as a probe. Based on the result of the Southern blot, 3.5 kb 
HindHI DNA segments was extracted from H. polymorpha chromosomal DNA to prepare 
a genome library, which was transformed into E. coli. A DNA segment reactive with the 
DNA probe was isolated by colony PCR and subjected to DNA sequencing to identify a 
10 DNA segment comprising ORF (Open Reading Frame) with a size ofl728 bp showing a 
high similarity with the Saccharomyces S. cerevisiae YPS1 gene (Fig. 1). Tne H. 
polymorpha YPS1 gene (HpYPSl) product has a signal sequence of 1-17 amino acids at 
its N-end and a region of 556-575 amino acids, presumed as a domain which can be 
anchored on the membrane of Golgi apparatus, as reported onS. cerevisiae yapsinl. 
15 Also, it has structural features, by which a glycosylphosphatidylinositol anchor 

can be attached thereto, as in S.cerevisiae yapsins (Fig. 1). R polymorpha yapsinl 
(SEQ JD NO: 2) shows a homology of 36% and a high similarity of 52% with S. 
cerevisiae yapsinl and a homology of 30% or more with other yapsin proteases (Fig. 2, 
Table 1). 

20 
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Table 1 

Homology and similarity of HpYPSland ScYPS genes 





ScYPSl 


ScYPS2 


ScYPS3 


ScYPS6 


ScYPS7 


HpYPSl 


36%(52%) 


31%(49%) 


30%(44%) 


26%(44%) 


29%(34%) 



Parenthesized number is similarity 



<Example 2> 

Functional analysis of the Hansemda polymorpha HpYPSl gene 

In case of the traditional yeast Sacchammyces cerevisiae, it has been reported 
that YPS1/YPS2 double deficient mutant strain derived from S. cerevisiae W303 strain as 
a parental strain shows a such high temperature sensitivity that it cannot grow at 37 TC 
(Komano and Fuller, Proc. Natl. Acad. Sci. USA 7,92,10752-10756 (1995)). Also, 
SUll8(ypslA/yps2A/yps3A) strain (Korean Patent Registration No. 10-0386836), 
which is a yapsin multiple deficient mutant strain having three genes of YPS1/YPS2/YPS3 
destroyed, prepared using Saccharomyces cerevisiae 13262 as a parental strain by the 
present inventors, was found to show temperature sensitivity at a high temperature. As a 
method for functional analysis of the Hansemda polymorpha yapsinl gene HpYPSl > a 
functional complementation experiment was conducted by transforming the HpYPSl 
gene into the above-described Sacchammyces cerevisiae multiple yapsin mutant strain 
according to the lithium chloride-DMSO method (Hill et al., Nucleic Acid Res., 19, 5791 
(1991)) to examine whether the expression otHansenula polymorpha yapsinl gene could 
restore temperature sensitivities of the Sacclwromyces cerevisiae multiple yapsin mutant 
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strains. In order to express the HpYPSl gene in Saccfiaromyces cerevisiae, 3.2 kb DNA 
segment including the HpYPSl gene was inserted into the Saccltaromyces cerevisiae 
multicopy vector YEp352 (Hill et aL Yeast 2, 163, (1986)) to generate YEp-HpYPSL 

As shown in Fig. 3, the Saccharomyces cerevisiae yapsin multiple deficient 
5 mutant strains transformed with the" YEp352-HpYPSl vector containing the HpYPSl gene 
showed the same growth with the wild type strain at 371C, while the yapsin multiple 
deficient mutant strain transformed with only YEp352 vector, used as control, did not 
grow at 37'C. From these results, it was proved that the temperature sensitivity due to 
the multiple deletion of Sacclmromyces cerevisiae yapsin genes can be overcome by the 
10 expression of the Hansenula polymorpha yapsin 1 gene. Thus, there was provided a basis 
supporting that the H polymorpha gene HpYPSl cloned according to the present 
invention is a functional homologue of yapsin gene of S. cerevisiae. 

<Example 3> 

1 5 Preparation of the HpYPSl gene-deficient strain and analysis of yapsin activity 

Using die HpYPSl gene prepared above, a K polymorpha yapsin deficient 
mutant strain hpypslA was synthesized and assayed for change in its protein 
decomposition activity by yapsin. In order to prepare a mutant strain having the deletion 
of die HpYPSl gene encoding Hansenula polymorpha yapsin 1, a fusion PCR (Oldenhuig 

10 et al., Nucleic Acid Res. 25, 451, (1997)) was conducted on H polymorpha chromosomal 
DNA using primers described in Table 2. Hie resulting DNA segment was transformed 
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into Hansenula polymoipha DLl-LdU (leu2Aura3::lacz; Kang et al., In Hawenula 
pol)miorpha:Biology and Application (Ed. G Gellissen), pp 124 (2002)) strain to induce 
in vivo homologous recombination, thereby attempting deletion of the gene. More 
specifically, a primary PCR was firstly conducted to obtain the N-ends, C-ends of 
5 HpYPSl gene and URA3 gene. Then, a secondary fusion PCR was conducted to fuse 
the N-end and C-end of HpYPSl gene with the N-end and C-end of URA3 gene, 
respectively. The two DNA segments thus obtained were introduced to yeast cells. 

Using URA3 as a selection marker, viable transformants were selected in a 
minimal medium lacking uracil. Construction of the hpypslA strain (Jeu2Aura3::lacz 
10 ypsl:: URA3\ which is a H polymorpha mutant strain having the deleted HpYPSl gene, 
was confirmed by PCR, which generated the DNA segments with different sizes between 
the wild type strain and the mutant strain. (Fig. 4). 

For the comparison of the yapsin activity of H polymorpha hpypslA mutant 
strain prepared above with that of the wild type strain, each of the two strains was 
1 5 cultured for 1 0 hours in YPD medium (1% yeast extract, 2% peptone, 2% glucose) using 
human parathyroid hormone (hPTH) as a substrate. The resulting yeast culture 
supernatant was analyzed for its protein degradation activity 20 frt of the yeast culture 
supernatant which had been diluted 1/4 was mixed with hPTH (about 1.6 /«) acting as a 
substrate. The reaction was incubated for 2 hours, 4 hours and 6 hours at 37 Each 
20 product was loaded on SDS-polyacrylamide gel to analyze the degradation level ofhPTH 
by the yapsin activity. 
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As shown in Fig. 5, in the culture fluid of the wild type, it was already hard to 
observe hPTH remaining in the culture supernatant after 2 hour cultivation, while in the 
culture supernatant of hpypslA mutant strain, a substantial amount of hPTH was 
observed, though the amount was somewhat reduced as the reaction time became longer. 

This suggests that the culture supernatant of hpypslA, the Hansenula 
polymorpha HpYPSl deficient mutant strain, had a significantly reduced hPTH 
degradation activity, as compared to the culture supernatant of the wild type strain. Such 
result shows the same result with the previous study performed on S. cerevisiae YPS1 
deficient mutant strain (Kang et al., Appl Microbiol Biotechnol. 50, 187, (1998)) and thus, 
supports the feet that HpYPSl is a gene encoding yapsin protease of H. polymorpha and 
the HpYPSl deficient mutant strain developed according to the present invention has the 
yapsin activity reduced. 



Table 2 



1 5 Primers used in fusion PCR for destruction of HpYPSl gene 



Primer 


Sequence 


SEQIDNO: 


YPS(NF) 


5'-GGACACGCAAGAGGTGTCTG- 3' 


5 


YPS(NR+rp) 


■jr- ; ■ 

AGCTCGCTACCCGGGGATCCGCAACTTTCATTGT 
GTCAAC-3' 


6 


YPS(CF+ip) 


5*- ~ " ■ 

GCACATCCCCCTTTCGCCAGCCTCTTCGGTGCGG 
TTGACC-3' 


7 


YPS(CR) 


S'-GCTCGGCTCCAGGATTCAGG- 3' 


8 


UPAS N-S 


5-GGATCCCCGGGTACCGAGCT- 3' 


9 
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5'-CACCGGTAGCTAATGATCCC- 3' 


10 


URA3 C-S 


5-CGAACATCCAAGTGGGCCGA- 3' 


11 


URA3 C-A 


5-CTGGCGAAAGGGGGATGTGC- 3' 


12 



<Example 4> 

Construction of the recombinant H. polymorpha strain expressing hPTH and analysis of 
the hPTH expression 

5 In order to practically express and secret human parathyroid hormone (hPTH) in 

H. polymorpha, pMOXhPTH, a human parathyroid hormone expression vector for H. 
polymorpha was prepared according to a method shown in Fig. 6. Thus, the 0.53 kb 
EcoR I /Sal I segment containing the hPTH cDNA fused with a MFcc signal sequence 
was prepared from pGlO-hPTH , a hPTH egression vector for S. cerevisiae (Chung et 

10 al., Biotechnol Bioeng. 57, 245, (1998)) and the 7.8 kb Xba I /EcoR I segment having 
the albumin cDNA removed was prepared from pYHSA12, a human serum albumin 
expression vector for R polymorpha (Kang et al., Biotechnol Bioeng. 76, 175, (2001)). 

The two DNA segments were joined to generate pMOXhPTH. The resulting 
vector pMOXhPTH was introduced into the K polymorpha hpypsl& mutant strain and 

15 the wild type strain, and viable transformants were selected in minimal medium lacking 
leucine. At every 24 hours after inoculation, Leu + transformants were transferred to 
liquid m inimal selective medium lacking leucine 5 times to stabilize the Leu + 
transformants (Sohn et al., Appl Microbiol Biotechnol. 51, 800, (1999)), The culture 
broth of the Leu* transformants obtained as described above was plated onto minimal 
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media containing G418 at various concentrations and cultured at 371:. DNA was 
isolated from each of the resulting colonies and subjected to Southern blot analysis using 
the 1.5 kb H polymorpha LEU2 gene as a hybridization prove according to the method 
described by Sambrook et al. (Molecular cloning Cold Spring Harbor Laboratory Press, 
5 1 989) to confirm insertion into the chromosome of the expression vector (Fig. 7A). The H. 
polymorpha LEU2 gene used as a probe was prepared by labeling with digoxigenin using 
the non-radioactive DNA labeling and detection kit As having been expected, in the 
transfonnants selected on the medium containing a high G418 concentration, multiple 
integration of the vector pMOXhPTH was observed. Upon comparison of intensity 
10 between the LEW gene signal on the chromosome and the LEU2 gene signal on the 
inserted vector, it was presumed that about 5 to 6 copies at most had been inserted. 

Some tranformants (a, b, c, d, e and f) of the recombinant yeast strains which 
had been confirmed to have the hPTH expression vector integrated into the chromosome 
were inoculated into YPM medium (1% yeast extract, 2% peptone, 2% methanol) and 
15 cultured at 371?. Yeast culture supernatants obtained at 12 hours and 24 hours after 
initiation of the cultivation were treated with TCA (trichloroacetic acid). The proteins 
secreted out of the cells were concentrated to 1/20 of the initial volume and 
electiophoresed on 15% SDS-polyacrylamide gel, followed by staining with Coomassie 
Brilliant Blue R-250 (Fig. 7B). 

20 When the hPTH secreted in polymorpha wild type was compared with mat 

secreted in the hpypslA mutant strain, only a trace of decomposition product (dl) was 
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observed in the culture supernatant obtained from the wild type and hPTH of a whole 
size (i) was hardly observed, since most of hPTH was decomposed. On the other hand, 
secretion of hPTH of a whole size was clearly observed in the culture supernatant 
obtained from the hpypslA mutant strain at 12 hours after cultivation. From these 
results, it was shown that the hpypsl& mutant strain having the yapsinl gene destroyed 
is a more useful strain as a host for secretory expression of recombinant hPTH, as 
compared to the wild type, since the decomposition of the recombinant parathyroid 
hormone is considerably inhibited by the reduction of yapsin activity, although hPTH of a 
whole size was reduced in the hpypsl A mutant strain while the band (d2), it is presumed, 
of hPTH having the C-end decomposed was observed, as the cultivation time became 
longer. 

<Example 5> 

Analysis of expression and secretion of recombinant human serum albumin in K 
polymorpha HpYPSl gene deficient mutant strain 

The human serum albumin (HSA) expressed in & cerevisiae is secreted as an 
intact form of 67 kDa, however some recombinant HSA in a decomposed form of 45 kDa 
also have been observed. It was reported that degradation of HSA secreted to yeast cell 
culture supernatant, particularly production of a decomjjosition product with a size of 45 
kDa was reduced when a S. cerevisiae strain with the yapsinl gene destroyed was used as 
a host (Kerry Williams et al., Yeast 14, 161, (1998)). In order to analyze the expression 
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and degradation aspects of recombinant HSA in the K polymorpha HpYPSl gene 
deficient mutant strain developed according to the present invention, the hpypsl A 
mutant strain was transformed with an expression vector pYHSA12 (Kang et al., Biotech 
Bioeng. 76, 175, (2001)), in which the MOX promoter and the HSA cDNA were joined to 
a vector for copy-number controlled gene integration using the LEU2 gene and G418 
resistance gene as selective markers. Recombinant H. polymorpha wild type strain and 
hpypsl A mutant strain were cultured in YPGM medium (1% yeast extract, 2% peptone, 
1% glycerol, 2% methanol). Each of the culture supernatants thus-obtained was loaded 
on SDS-polyacrylamide gel, followed by staining with silver nitrate, or transferred to a 
nitrocellulose membrane, followed by Western blot by HSA antibody to examine 
expression and degradation aspects of HSA. As shown in Fig. 8, it was observed that 
more HSA was secreted in the hpypsl A mutant strain than the wild type strain, 
particularly after cultivation for 24 hours. Though the overall degradation of albumin 
was not significantly inhibited, it was shown that degradation product with a size of 45 
kDa was apparently reduced in the hpypsl A mutant strain, as compared to the wild type, 
as observed particularly in S. cerevisiae. Considering cultivation in a high concentration 
fennenter, production of 45 kDa degraded HSA product presents more serious problems, 
it is expected that an albumin production system using the hpypsl A mutant strain as a 
host can increase production of albumin due to a remarkable reduction in degradation of 
albumin, as compared to an production system using the wild type strain as a host 
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<£xample 6> 

Analysis of expression and secretion of recombinant albumin fusion TMP2 (HSA- 
1TMP2) in H. polymoiplia HpYPSl gene deficient mutant strain 

As a part of a method to increase blood persistence of a protein therapeutic agent, 
5 researches to develop a technology to increase in vivo half-life of a medical protein by 
expressing a recombinant protein in a form fused to albumin, a blood protein which has a 
long stability, are in the spotlight (Smith et al., Bioconjugate Chem. 12, 750-756, (2001); 
Sheffield et al. Blood Coagul Fibrinol. 12, 433-443, (2001)). The present inventors also 
have developed a recombinant HMP-2 having in vivo stability significantly increased by 

10 expressing TTMP-2, which attracts public attention as a next generation anti-angiogenesis 
agent and anti-tumor agent, as a recombinant protein fused to albumin in S. cerevisiae 
(Korea Patent Registration No. 10-2001-0027823, International Application No. 
PCT/KR03/00015). In order to analyze the expression and e decomposition aspect of 
the recombinant HSA-TTMP2 in a form fused to albumin in the H polymorph* HpYPSl 

15 gene deficient mutant strain (hpypslA) developed according to the present invention, 
YHSA13-T2, a HSA-HMP2 expression vector for K polymorpha was prepared 
according to the method shown in Fig. 9. That is, the DNA fragments encoding HSA 
and TEMP2 were prepared using the PCR primers described in Table 3. Fusion PCR 
was conducted using the prepared genes in a ratio of 1:1 to prepare the 2.4 kb HSA- 
20 TJMP2 DNA segment comprising HSA (1.8 kb) and 1TMP2 (0.586 kb), which are 
connected to each other. The resulting segment was cloned into pGEM T vector 
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(Promega, USA) to prepare pTHSA-TIMP2. The sequence of the HSA-TEMP2 DNA 
segment was confirmed by sequencing analysis. Then, pTHSA-TIMP2 was cut with 
Xbal and Spel. The resulting XbaVSpel HSA-TMP2 gene segment of 1.2 kb was 
joined to the Xbal-digested pYHSA12(+), a K polymoiyha multiple tandem introduction 
5 vector having albumin gene inserted (Kang et aL, Biotechnol Bioeng. 76, 175-185, 
(2001)), to prepare pYHSA13-T2 (Fig. 9). 

Table 3 



PCR primers used in preparation of HSA-TIMP2 fusion gene 



primer 


Sequence 


Note 


HSAEcoRF 


5 ! gaattcatgaagtgggtaaccttt 3' 
(SEQIDNO:13) 


HS A forward direction 


Hs-R2 


5 f taagcctaaggcagcttgac 3' 
(SEQIDNO:14) 


HS A reverse direction 


H-T2-F 


5'caagctgccttaggcttatgcagctgctccccggtg 3* 
(SEQIDNO:15) 


Tlmp2 forward 
direction, used with 18 
bp at HSA C-end in 
fusion PCR 


R-T2-Sp 


5' actagtgatttatgggtcctcgatg 3' 
(SEQIDNO:16) 


7imp2 reverse direction 



10 

The albumin fusion protein expression vector prepared above was introduced 
into K polymorpha DL1-L (leu2) and the hpypslA (leu2 hpypsl::URA3) mutant strain 
to prepare Leu + transformants. The collected transfonnants were passage-cultured five 
times on selective medium so that the expression vectors could be multiply integrated to 
1 5 the host chromosomal DNA. Then, the transfonnants were plated at a density of Ix 10 5 
to 10 6 per plate in media containing antibiotic G418 at various concentrations to select 
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transformants having resistance to G41 8. In order to examine whether the transformants, 
which had been confirmed to have the expression vector pYHSA13-T2 integrated into the 
host chromosome by Western blotting, secrets and expresses the fusion protein with an 
expected size, the yeast culture supernatants obtained by culturing the transformants for 
5 48 hours in YPM was analyzed by Western blotting using antibody to albumin. As 
shown in Fig. 10, it was observed that the fusion protein HSA-TIMP2 (88 kDa) having an 
increased size as compared to HSA (66.5 kDa) was secreted as expected. Interestingly, 
in case of the yapsin deficient mutant strain hpypslA, only 88 kDa HSA-TTMP2 with a 

ft 

whole size was observed without any decomposition product However, in the wild type 
10 DL1 strain, a band being presumed as a decomposition product was observed just under 
the band of HSA-TTMP2 with a size of 88 kDa. Therefore, due to the reduction of 
decomposition products, the expressed amount of HSA-Timp2 fusion protein was about 
two times higher in the hpypslA mutant strain than the wild type strain This suggests 
that protein degradation by yapsinl is inhibited in secretion of recombinant proteins 
15 expressed as an albumin-fusion fonn as well as secretion of the above-described 
recombinant albumin in the hpypsl A mutant strain 



Industrial Applicability 



20 The present invention can be usefully used in bioengineering industry to produce 

a recombinant protein using H. polymorpha since cleavage of the recombinant protein by 
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yapsin activity can be remarkably reduced by using K polymorph* strain (hpypslA) 
with the protease yapsinl gene being deficient as a recombinant protein-producing host, 
thereby secreting and producing the recombinant protein in an intact configuration at a 
high yield. 
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1 . A nucleic acid molecule comprising the sequence shown in Fig. 1 . 

5 2. The nucleic acid molecule according to claim 1, which is the HpYPSl gene encoding 
Hansenula polymorpha yapsinl (Accession No. KCTC 10285BP). 

3. A polypeptide comprising the amino acid sequence shown in Fig.l. 

10 4. The polypeptide according to claim 3, which is Hansenula polymorph* yapsinl which 
can cleave a protein comprising a basic or dibasic amino acid residue. 

5. The polypeptide according to claim 3, which is a secretion signal peptide of HpYPSl 
polypeptide used as a secretion signal of a foreign protein. 

15 

6. A Hansenula polymorph** mutant strain having yapsin activity reduced by mutation of 
HpYPSl gene encoding Hansenula polymorpha yapsinl . 

7. The Hansenula polymorpha mutant strain according to claim 6 deposited under 
20 Accession No. KCTC 10281BP. 
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8. A recombinant Hansenvla polymorpha strain expressing a foreign protein which is 
prepared by introducing a gene encoding the foreign protein to the Hansenula 
polymorpha mutant strain according to claim 6. 

5 9. The recombinant Hansenula polymorpha strain according to claim 8, which is 
hpypsl A -pMOXhPTH (KCTC 10282BP). 

10. The recombinant Hansenula polymorpha strain according to claim 8, which is 
hpypsl A -pYHSA12 (KCTC 10283BP), 

10 

11. The recombinant Hansenula polymorpha strain according to claim 8, which is 
hpypsl A-pYHSA13-TMP2 (KCTC 10485BP). 

12. A process for preparing and isolating a foreign protein comprising expressing the 
1 5 foreign protein using the Hansenula polymorpha yapsinl deficient strain according to any 

one of claims 8 to 1 1 as a host. 

13. The process according to claim 12, in which the foreign protein is a recombinant 
protein containing a basic or dibasic amino acid residue which can be cleaved by yapsinl . 



20 



14. The process according to claim 13, in which the protein containing a basic or dibasic 
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amino acid residue comprises human parathyroid hormone, human serum albumin and 
albumin fusion protein. 
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FIG.l 



1 agttgagtcgcaatagtgtggcgaacttca aatgcccttactgtccgcgaacaaccacca ttgcccaggctgtgcaggccagat ttgtta 

91 atttgtgaaaagtggaaaaaatttattccg ctatgcctaaccgaagagcccgcaagaaga ggcggacagaagacttttccagctcttcgg 

181 catctgaaaacgatagtgactccgagagcg tgaccagtgtacaggaagagcagccggatg cgcccgaaacatacacaatagatggcctgg 

271 acacgcaagaggtgtctgacagcacacagg tgagactccaacagctgaacgcagacaggt tggccagcatagagcaaagcct t tcaggca 

361 acctcaaactggacataaacgcagtacgcc agatagatgatgtgcgtgagcagctgcaga acgagtatttgaagaaattgct tgtcacat 

451 attctgaggacctggatgcgctgcgtcaga aaaccgatttcaaggaaaactcactcaaaa ccctcgcccgtcttctcaaagagagcggaa 

541 acatatttgatgatggaactctcaagtcgc tagttgagtgatgtatatgataatgtctaa ttttaattttcatcagtgtgcaagatctgg 

631 gcttagccgttctaaatggtatattcaggc tgtgcaagccacatttaaaattaccccatc ggtttttaaattctattgttagaaattagg 

721 atctacatagaggtagagtgagcaacagaa cattgtttgctatccgggccctccgactgg aacgtcttaccttcagctactat t tattca 

811 gaaaaaagagtgcattttcatctatcaagg Ictcaaagtgtcgaatcaaatcactagtat tttttccgagactaaaaaaaagt tgacaca 

901 ATGAAAGnGCTACACTGTTTTTCTTGGCT TCGAGTGTCTGTGTGCTGGGAGACCCACAG TTCGTGAAACTGGAGGCCTCTGTTCTTCGG 
fi^ M K V A T I FFl A ft s v r v/ i n 0 PQ FVKLEASV I R 
991 GGATCCACTTACAAGGATTCCCAGAA6GGG 6CCAAGCCGTTCATGTTGGAAAAGAGGGCT GATGACGGCTCGGTCACGATGGAATTGCAG 

GSTYKDSQKG AKPFMIEKRA DDGSVTME LQ 
1081 AACGCCCAGTCTTTCTACCAAGTCGAGATC GAGATAGGATCTGATAA6CAGAA6GTGGGG GTTTTGATTGATACCGGTTCCTCGGACTTG 

NAQSFYQVE I EIGSOKQKVG VLIDTGSSDL 
1171 TGGGTGATGAACTCGAATAACTCTTACTGT TCGTCTTCCAGCACTAAAAAATTGAAACGG GACGGACCGGCCGATGCGCTACAAAAAG6A 
WVMNSNNSYC SSSSTKKLKR DGPADALQ KG 
1260 CGCGATCTTTCCGACCTGTACAATTTCAAC TCTCCAAACGAAGACAACAATGCAAAAGGA TTCTTGGGTGGCTGGGGAGACTTGACCACA 

RDLSDLYNFN SPNEDNNAKG FLGGWGDL TT 
1351 GTAGAGACTGCAACCCAGGATGAGACACAG ACGGCTCTCGCTGCGCAGGCCACCGTGGAC TGCTCGCTATACGGAACGTTCAATCCTTCA 

VETATQOETQ FALAAQATVD CSLYGTFN PS 
1441 ACGTCCAATTCGTTCCACAACAACGGCACC ACATTTGAGATTTCGTACGCGGACCGCACT TTT6CCCGTGGAACCTG6GGCTAC6ATGAT 

7SNSFHNNGT TFEISYAORT FARGTWGY DD 
1531 GTCACTTTCAATGGTGTCACGGTTAACGAT CTCTCGTTGGCCGTGGCAGATGAAACAGAT TCTTCGACTGGTGTTTTTGGTATCGGATTG 

VTFNG'VTVND LSLAVADETD SSTGVFGI GL 
1621 AGGGAATTGGAAACCACATACTCAGGAGGC (36ACCACAGCATTACATCTACGACAACTTA CCTTTCAAAATGGTCGACCAGGGACTCATC 

RELETTYSGG GPQHYIYDNL PFKMVOQG L i 
1711 AATAGAGCCGCCTATTCCGTCTACCTGAAC TCAACTGAGTCCAGCACTGCCTCGATCCTC TTCGGTGC6GTTGACCAAAGCAAATATACC 

NRAAYSVYLN STESSTASIL FGAVDQSK YT 
1801 GGAAGTCTTGGCTTGCTTCCTATCATCAAC ACGGCTGCTTCCTAC6GTTACCAAAAGCCT CTAAGGCTCCAAATCACCCTGTCTGCCATT 

G S L G L L P ) IN TAASYGYQKP LRLQITLS Al 
1891 ACGGTCAGCGACTCCAGAGGACAGCAAGCA A6CATTGGTTCAG6AGCTGCTGCTGCACTT CTTGATACCGGAACGACTTTGACGTATGCT 

TVSDSRGQQA SI6SGAAAAL LDTGTTLT YA 
1981 CCAAGCGAGATTGTCGAGAAACTTGCTGAA ACCCTAGGCTTCGACTACAGCAGCTCTGTC GGGGCCTACGTGGCAAGATGCAGGGACGTT 

PSE IVEKLAE TLGFOYSSSV GAYVARCR D V 
2071 GATAGCTACGCTGTCAACTTCGACTTCCAG GfiTAAAGTGATTGAAGCTCCTTTGAGTTCC TTCCTGATT6CTCTGCAAACCAACTCCGGA 

DSYAVNFDFQ 6 K V I E A P L $ $ FLIALQTNSG 
2161 GAAGTTTCCTCCTACTGCGCATTGGGTATT TTCTCCTCTGGAGACGAATCCTTCACGCTC GGCGATACTTTCCTGCGAAACGCCTACTTT 

EVSSYCALGI FSSGDESFTL GDTFLRNAYF 
2251 GTK3CTGACCTCGAGGGATATCAAATCGCT ATA6CTAACGTGAACCTGAATCCTGGAGCC GAGCAAATTGAGGTCATCTCAGGCAACTCC 

VADLEGYQIA IANVNLNPGA EQIEVISGNS 
2341 ATTCCTTCTGCTTCGTCGGTTTCCGATTAC TCCAATACCTGGGGCGCCTCTGCCACCGCT TTGGACACTGACAGGCCTACTACTCTGGGA 

IPSASSVSDY SNTW6ASATA LDTORPTT LG 
2431 TCTGTGACTGCTGTGGGCGATGAAAGAGTG ACCTCGACCAAGAAGGT7TCGAGTGTGAAG ACAAGCACTTCGTCCGGGTCCGGGTCCACT 
SVTAVGDERV TSTKKVSSVK TSTSSGSGST 
2521 TCGGAGTCGTCTACGTCCAGTTCGCATTCC AflCAATGGCCCAAGGACAGTAGGCTTTAGT TTGTGTGCCGTTTTGTGCGCATTCTTGATT 

SESSTSSSH ^|)LJLJLP ^ v fi F « I Lj v * ^ a c i . 
2611 TCTATACTAGTTGTTTGCtagatctgaagrtctaaggggct t taSfct teat t tatgat t 1 1 1 1 1 1 tat t tggaccgcctcgaa 1 1 gt 1 1 



2701 t tecgaegggt ctact t taaagctgeaaga tetegt t tagegtegt t tat t tetegt teg 1 1 tagtgacaaaaaaacagaaaaaaaaac t 

2791 ataaaaageggtatataacctttatatttt gataaacatgagcagcgaaattaagctagc accaaaggattacgagaaggacaaggagtt 

2881 cgccaaggctctgcatggcaaggacgcegc gagcgctacaggaatgagtgct tgggtgaa gaaggacaaggaagctcaaaaagtcgcgat 

2971 ggaaggatat t tcaagcactgggacgggaa aaccgacgaggagactgaaaagtcgagact cgaggactactcgacgctcaccaagcacta 

3061 ctacaacctggtgacggatttctacgagta tggatggggatcctcgttccacttttccag atactacaagggagagccatttagacaagc 
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FIG. 2 
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FIG. 3 
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FIG. 7 
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FIG. 10 
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Sequence Listing 



<110> Korea Research Institute of Bioscience and Biotechnology 
LeadBio, Inc 
Bio Holdings CO. , LTD 

<120> Hansenula polymorpha yapsin deficient mutant strain and process 
for the preparation of recombinant proteins using the same 

<160> 16 

<170> Kopatentln 1»71 

<210> 1 

<211> 31S1 

<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> sig_peptide 

<222> (901) (903) 

<223> initiation codon 



<220> 

<221> 5'UTR 

<222> (1)..(900) 

<220> 

<221> 3'UTR 

<222> (2622) (31S1) 

<220> 

<221> terminator 

<222> (2629) (2631) 

<223> termination codon 

<220> 

<221> CDS 

<222> (901) . . (2628) 

<223> coding sequence 



WO 2004/003204 



-PCT/KR2003/001279 

Sequence Listing 



<400> 1 

agttgagtcg caatagtgtg gcgaacttca aatgccctta ctgtccgcga acaaccacca 60 

ttgcccaggc tgtgcaggcc agatttgtta atttgtgaaa agtggaaaaa atttattccg 120 

ctatgcctaa ccgaagagcc cgcaagaaga ggcggacaga agacttttcc agctcttcgg leo 

catctgaaaa cgatagtgac tccgagagcg tgaccagtgt acaggaagag cagccggatg 240 

cgcccgaaac atacacaata gatggcctgg acacgcaaga ggtgtctgac agcacacagg 300 

tgagactcca acagctgaac gcagacaggt tggccagcat agagcaaagc ctttcaggca 360 

acctcaaact ggacataaac gcagtacgcc agatagatga tgtgcgtgag cagctgcaga 420 

acgagtattt gaagaaattg cttgtcacat attctgagga cctggatgcg ctgcgtcaga 480 

aaaccgattt caaggaaaac tcactcaaaa ccctcgcccg tcttctcaaa gagagcggaa 54 0 

acatatttga tgatggaact ctcaagtcgc tagttgagtg atgtatatga taatgtctaa 600 

ttttaatttt catcagtgtg caagatctgg gcttagccgt tctaaatggt atattcaggc 660 

tgtgcaagcc acatttaaaa ttaccccatc ggtttttaaa ttctattgtt agaaattagg 720 

atctacatag aggtagagtg agcaacagaa cattgtttgc tatccgggcc ctccgactgg 780 

aacgtcttac cttcagctac tatttattca gaaaaaagag tgcattttca tctatcaagg 840 

tctcaaagtg tcgaatcaaa tcactagtat tttttccgag actaaaaaaa agttgacaca 900 

atg aaa gtt get aca ctg ttt etc ctg get teg agt gec tgt gtg ctg 94 8 
Met Lys Val Ala Thr Leu Phe Phe Leu Ala Ser Ser Val Cys Val Leu 
15 10 IS 

gga gac cca cag ttc gtg aaa ctg gag gec tct gtt ctt egg gga tec 996 
Gly Asp Pro Gin Phe Val Lys Leu Glu Ala Ser Val Leu Arg Gly Ser 
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act tac aag gat tec cag aag ggg gec aag.ccg ttc atg ttg gaa aag 1044 
Thr Tyr Lys Asp Ser Gin Lys Gly Ala Lya Pro Phe Met Leu Glu Lys 
35 40 4S 

agg get gat gac ggc teg gtc acg atg gaa ttg cag aac gee cag tct 1092 
Arg Ala Asp Asp Gly Ser Val Thr Met Glu Leu Gin Asn Ala Gin Ser 
SO 55 60 

ttc tac caa gtc gag ate gag ata gga tct gat aag cag aag gtg ggg 1140 
Phe Tyr Gin Val Glu lie Glu lie Gly Ser Asp Lys Gin Lys Val Gly 
65 70 75 80 

gtt ttg att gat ace ggt tec teg gac ttg tgg gtg atg aac teg aat 1188 
Val Leu lie Asp Thr Gly Ser Ser Asp Leu Trp Val Met Asn Ser Asn 
85 90 95 

aac tct tac tgt teg tct tec age act aaa aaa ttg aaa egg gac gga 1236 
Asn Ser Tyr Cys Ser Ser Ser Ser Thr Lys Lys Leu Lys Arg Asp Gly 
100 105 110 

ccg gee gat gcg eta caa aaa gga cgc gat ctt tec gac ctg tac aat 1284 
Pro Ala Asp Ala Leu Gin Lys Gly Arg Asp Leu Ser Asp Leu Tyr Asn 
115 120 125 

ttc aac tct cca aac gaa gac aac aat gca aaa gga ttc ttg ggt ggc 1332 
Phe Asn Ser Pro Asn Glu Asp Asn Asn Ala Lys Gly Phe Leu Gly Gly 
130 135 140 

tgg gga gac ttg ace aca gta gag act gca acc cag gat gag aca cag 1380 
Trp Gly Asp Leu Thr Thr Val Glu Thr Ala Thr Gin Asp Glu Thr Gin 
145 150 1S5 160 

acg get etc get gcg cag gee acc gtg gac tgc teg eta tac gga acg 1428 
Thr Ala Leu Ala Ala Gin Ala Thr Val Asp Cys Ser Leu Tyr Gly Thr 
165 170 175 

ttc aat cct tea acg tec aat teg ttc cac aac aac ggc acc aca ttt 14 76 - 

Phe Asn Pro Ser Thr Ser Asn Ser Phe His Asn Asn Gly Thr Thr Phe 
180 185 190 
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gag act teg tac gcg gac cgc act ttt gec cgt gga acc tgg ggc cac 1524 
Glu He Ser Tyr Ala Asp Arg Thr Phe Ala Arg Gly Thr Trp Gly Tyr 
195 200 205 

gat gat gtc act ttc aat ggt gtc acg gtt aac gat etc teg ttg gee 1572 
Asp Asp Val Thr Phe Asn Gly Val Thr Val Asn Asp Leu Ser Leu Ala 
210 215 220 

gtg gca gat gaa aca gat tct teg act ggt gtt ttt ggt ate gga ttg 1620 
Val Ala Asp Glu Thr Asp Ser Ser Thr Gly Val Phe Gly He Gly Leu 
22S 230 235 240 

agg gaa ttg gaa acc aca tac tea gga ggc gga cca cag cat tac ate 1668 
Arg Glu Leu Glu Thr Thr Tyr Ser Gly Gly Gly Pro Gin His Tyr He 
245 250 255 

tac gac aac tta cct ttc aaa atg gtc gac cag gga etc ate aat aga 1716 
Tyr Asp Asn Leu Pro Phe Lys Met Val Asp Gin Gly Leu He Asn Arg 
260 265 270 

gee gec tat tec gtc tac ctg aac tea act gag tec age act gec teg 1764 
Ala Ala Tyr Ser Val Tyr Leu Asn Ser Thr Glu Ser Ser Thr Ala Ser 
275 280 28S 

ate etc ttc ggt gcg gtt gac caa age aaa tat acc gga agt ctt ggc 1812 
He Leu Phe Gly Ala Val Asp Gin Ser Lys Tyr Thr Gly Ser Leu Gly 
290 295 300 

ttg ctt cct ate ate aac acg get get tec tac ggt tac caa aag cct i860 
Leu Leu Pro He He Asn Thr Ala Ala Ser Tyr Gly Tyr Gin Lys Pro 
305 310 315 320 

eta agg etc caa ate acc ctg tct gec att acg gtc age gac tec aga 1908 
Leu Arg Leu Gin He Thr Leu Ser Ala He Thr Val Ser Asp Ser Arg 
325 330 335 

gga cag caa gca age att ggt tea gga get get get gca ctt ctt gat 1956 
Gly Gin Gin Ala Ser lie Gly Ser Gly Ala Ala Ala Ala Leu Leu Asp 
340 345 350 
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acc gga acg act ttg acg tat get cca age gag att gtc gag aaa ctt 2004 
Thx Gly Thr Thr Leu Thr Tyr Ala Pro Ser Glu He Val Glu Lys Leu 
355 360 365 

get gaa acc eta ggc ttc gac tac age age tct gtc ggg gee tac gtg 2052 
Ala Glu Thr Leu Gly Phe Asp Tyr Ser Ser Ser Val Gly Ala Tyr Val 
370 375 380 

gca aga tgc agg gac gtt gat age tac get gtc aac ttc gac ttc cag 2100 
Ala Arg Cys Arg Asp Val Asp Ser Tyr Ala Val Asn Phe Asp Phe Gin 
38 5 390 395 400 

ggt aaa gtg att gaa get cct ttg agt tec ttc ctg att get ctg caa 2148 
Gly Lys Val He Glu Ala Pro Leu Ser Ser Phe Leu He Ala Leu Gin 
405 410 415 

acc aac tec gga gaa gtt tec tec tac tgc gca ttg ggt att ttc tec 2196 
Thr Asn Ser Gly Glu Val Ser Ser Tyr Cys Ala Leu Gly He Phe Ser 
420 425 430 

tct gga gac gaa tec ttc acg etc ggc gat act ttc ctg cga aac gee 2244 
Ser Gly Asp Glu Ser Phe Thr Leu Gly Asp Thr Phe Leu Arg Asn Ala 
435 440 445 

tac ttt gtg get gac etc gag gga tat caa ate get ata get aac gtg 22 92 

Tyr Phe Val Ala Asp Leu Glu Gly Tyr Gin He Ala He Ala Asn Val 
450 455 460 

aac ctg aat cct gga gec gag caa att gag gtc ate tea ggc aac tec 2340 
Asn Leu Asn Pro Gly Ala Glu Gin He Glu Val He Ser Gly Asn Ser 
4^5 470 475 48 o 

act cct tct get teg teg gtt tec gat tac tec aat acc tgg ggc gee 2388 
He Pro Ser Ala Ser Ser Val Ser Asp Tyr Ser Asn Thr Trp Gly Ala 
485 490 495 

tct gee acc get ttg gac act gac agg cct act act ctg gga tct gtg 243 6 

Ser Ala Thr Ala Leu Asp Thr Asp Arg Pro Thr Thr Leu Gly Ser Val 
500 505 510 
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act get gtg ggc gat gaa aga gtg acc teg acc aag aag gtt teg agt 
Thr Ala Val Gly Asp Glu Arg Val Thr Ser Thr Lys Lys Val Ser Ser 
515 520 525 



2484 



gtg aag aca age act teg tec ggg tec ggg tec act teg gag teg tct 2532 
Val Lys Thr Ser Thr Ser Ser Gly Ser Gly Ser Thr Ser Glu Ser Ser 
530 535 540 

acg tec agt teg cat tec age aat ggc cca agg aca gta ggc ttt agt 2580 
Thr Ser Ser Ser His Ser Ser Asn Gly Pro Arg Thr Val Gly Phe Ser 
545 550 555 560 

ttg tgt gee gtt ttg tgc gca ttc ttg att tct ata eta gtt gtt tgc 2628 
Leu Cys Ala Val Leu Cys Ala Phe Leu lie Ser lie Leu Val Val Cys 
565 570 57S 

ta gatctgaagt tctaaggggc tttagtcttc atttatgatt tttttttatt 2680 

tggaccgcct cgaattgttt ttccgacggg tctactttaa agetgeaaga tetegtttag 2740 

cgtcgtttat ttctcgttcg tttagtgaca aaaaaacaga aaaaaaaact ataaaaagcg 2800 

gtatataacc tttatatttt gataaacatg ageagegaaa ttaagctagc accaaaggat 2860 

tacgagaagg acaaggagtt cgccaaggct ctgcatggca aggacgccgc gagegctaca 2920 

ggaatgagtg cttgggtgaa gaaggacaag gaagctcaaa aagtcgegat ggaaggatat 2980 

ttcaagcact gggacgggaa aaccgacgag gagactgaaa agtcgagact cgaggactac 3040 

tcgacgctca ccaagcacta ctacaacctg gtgaeggatt tctacgagta tggatgggga 3100 

tcctcgttcc acttttccag atactacaag ggagagecat ttagacaagc t 3151 



<210> 2 

<21l> S76 

<212> PRT 

<213> Hansenula polymorpha 
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<400> 2 

Met Lys Val Ala Thr Leu Phe Phe Leu Ala Ser Ser Val Cys Val Leu 
1 5 10 15 

Gly Asp Pro Gin Phe Val Lys Leu Glu Ala Ser Val Leu Arg Gly Ser 
20 25 30 

Thr Tyr Lys Asp Ser Gin Lys Gly Ala Lys Pro Phe Met Leu Glu Lys 
35 40 45 

Arg Ala Asp Asp Gly Ser Val Thr Met Glu Leu Gin Asn Ala Gin Ser 
50 55 so 

Phe Tyr Gin Val Glu lie Glu He Gly Ser Asp Lys Gin Lys Val Gly 
65 70 75 80 

Val Leu He Asp Thr Gly Ser Ser Asp Leu Trp Val Met Asn Ser Asn 
85 ' go 95 

Asn Ser Tyr Cys Ser Ser Ser Ser Thr Lys Lys Leu Lys Arg Asp Gly 
100 105 no 

Pro Ala Asp Ala Leu Gin Lys Gly Arg Asp Leu Ser Asp Leu Tyr Asn 
115 120 125 

Phe Asn Ser Pro Asn Glu Asp Asn Asn Ala Lys Gly Phe Leu Gly Gly 
130 135 140 

Trp Gly Asp Leu Thr Thr Val Glu Thr Ala Thr Gin Asp Glu Thr Gin 
145 ISO 155 160 

Thr Ala Leu Ala Ala Gin Ala Thr Val Asp Cys Ser Leu Tyr Gly Thr 
165 170 175 

Phe Asn Pro Ser Thr Ser Asn Ser Phe His Asn Asn Gly Thr Thr Phe 
180 185 iso 

Glu He Ser Tyr Ala Asp Arg Thr Phe Ala Arg Gly Thr Trp ciy Tyr 
1SS 200 205 



-7 



BNSDOCID: <WO 200400320 4A1J_> 



WO 2004/003204 



PCT/KR2003/0O1279 

Sequence Listing 



Asp Asp Val Thr Phe Asn Gly Val Thr Val Asn Asp Leu Ser Leu Ala 
210 215 220 

Val Ala Asp Glu Thr Asp Ser Ser Thr Gly Val Phe Gly lie Gly Leu 
225 230 235 240 

Arg Glu Leu Glu Thr Thr Tyr Ser Gly Gly Gly Pro Gin His Tyr lie 
245 250 255 

Tyr Asp Asn Leu Pro Phe Lys Met Val Asp Gin Gly Leu lie Asn Arg 
260 265 270 

Ala Ala Tyr Ser Val Tyr Leu Asn Ser Thr Glu Ser Ser Thr Ala Ser 
275 280 285 

lie Leu Phe Gly Ala Val Asp Gin Ser Lys Tyr Thr Gly Ser Leu Gly 
290 295 300 

Leu Leu Pro lie He Asa Thr Ala Ala Ser Tyr Gly Tyr Gin Lys Pro 
305 310 315 320 

Leu Arg Leu Gin He Thr Leu Ser Ala He Thr Val Ser Asp Ser Arg 
325 330 335 

Gly Gin Gin Ala Ser He dly Ser Gly Ala Ala Ala Ala Leu Leu Asp 
340 345 350 

Thr Gly Thr Thr Leu Thr Tyr Ala Pro Ser Glu lie Val Glu Lys Leu 
355 360 365 

Ala Glu Thr Leu Gly Phe Asp Tyr Ser Ser Ser Val Gly Ala Tyr Val 
370 375 380 

Ala Arg Cys Arg Asp Val Asp Ser Tyr Ala Val Asn Phe Asp Phe Gin 
385 390 395 400 

Gly Lys Val He Glu Ala Pro Leu Ser Ser Phe Leu He Ala Leu Gla 
405 410 415 
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Thr Asn Ser Gly Glu Val Ser Ser Tyr Cys Ala Leu Gly He Phe Ser 
420 425 430 

Ser Gly Asp Glu Ser Phe Thr Leu Gly Asp Thr Phe Leu Arg Aan Ala 
43S 440 445 

Tyr Phe Val Ala Asp Leu Glu Gly Tyr Gin He Ala He Ala Asn Val 
450 455 460 

Asn Leu Asn Pro Gly Ala Glu Gin He Glu Val He Ser Gly Asn Ser 
465 470 475 480 

He Pro Ser Ala Ser Ser Val Ser Asp Tyr Ser Asn Thr Trp Gly Ala 
485 490 495 

Ser Ala Thr Ala Leu Asp Thr Asp Arg Pro Thr Thr Leu Gly Ser Val 
500 505 510 

Thr Ala Val Gly Asp Glu Arg Val Thr Ser Thr Lys Lys Val Ser Ser 
515 520 S25 

Val Lys Thr Ser Thr Ser Ser Gly Ser Gly Ser Thr Ser Glu Ser Ser 
530 535 S40 

Thr Ser Ser Ser His Ser Ser Asn Gly Pro Arg Thr Val Gly Phe Ser 
S4S 550 555 560 



Leu Cys Ala Val Leu Cys Ala Phe Leu He Ser He Leu Val Val Cys 
565 570 575 



<210> 3 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 3 

gaagtgcagc agcagctcct gaacc 



25 



<210> 4 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



primer 



<400> 4 

ggctgatgac ggctcggtca cgatgg 



26 



<210> 5 

<211> 20 

<222> DNA 

<213> Artificial Sequence 



<220> 
<223> 



primer 



<400> 5 

ggacacgcaa gaggtgtctg 



20 



<210> 
<211> 
<212> 



6 

40 
DNA 



<213> Artificial Sequence 



<220> 
<223> 



primer 
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<400> 6 

agctcgctac ccggggatcc gcaactttca ttgtgtcaac 40 

<210> 7 

<211> 40 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 7 

gcacatcccc ctttcgccag cctcttcggt gcggttgacc 40 

<210> 8 

<211> 20 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 6 

gctcggctcc aggattcagg 20 

<210> 9 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 9 

ggatccccgg gtaccgagct 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 10 
caccggtagc taatgatccc 

<210> li 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 11 
cgaacatcca agtgggccga 

<210> 12 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 12 
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Sequence Listing 



ctggcgaaag ggggatgtgc 



20 



<210> 13 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 13 

gaattcatga agtgggtaac cttt 



24 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



primer 



<400> 14 
taagcctaag gcagcttgac 



20 



<210> IS 

<211> 36 

<212> DMA 

<213> Artificial Sequence 



<220> 
<223> 



primer 



<400> 15 

caagccgccc taggcttatg cagctgctcc ccggtg 



36 
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<210> 16 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 16 

actagtgatt tatgggtcct cgatg 25 
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